Voice-Based Control In A Media System Or Other Voice-Controllable Sound Generating System

ABSTRACT

A system for enhanced processing of voice-based signals in a voice-controllable sound-generating system (SGS) is provided. An SGS audio source may communicate electronic SGS audio signals to both (a) one or more speakers, which output corresponding SGS sound waves and (b) an audio countering system. A microphone may detect sound waves and output corresponding audio signals including: (a) distorted SGS audio signals corresponding with SGS sound waves and (b) additional audio signals originated from other sources, e.g., including voice-based commands. The audio countering system may (a) receive the electronic SGS audio signals from the SGS audio source; receive signals from the microphone representing the microphone-detected sound waves, and (c) use the electronic SGS audio signals received to cancel or counter the distorted SGS audio signals included in the microphone-received audio signals, to thereby enhance any voice-based commands included in the received audio signals.

RELATED PATENT APPLICATION

This application claims priority to commonly owned U.S. ProvisionalPatent Application No. 62/491,018 filed Apr. 27, 2017, which is herebyincorporated by reference in its entirety for all purposes.

TECHNICAL FIELD

The present disclosure relates to voice control of a system or device,and more particularly, for improved voice control of a system or devicethat generates sounds.

BACKGROUND

A growing number of systems are configured for voice-based control, orvoice command, of one or more aspects of such systems. These systems maybe referred to voice-controllable systems. A voice-controllable systemmay allow a user to easily control aspects of the system operation in ahands-free manner. Some example voice-controllable systems include homeappliances, mobile phones (e.g., for voice-based dialing, texting, webbrowsing, etc.), media systems (e.g., TV, stereos, etc.), computeroperating systems, commercial software for computers, internet searchengines, vehicles, and call centers. Voice control has improved inrecent years due to substantial advancements in voice recognition, e.g.,based on the advancement of deep learning generated algorithms and thedevelopment of graphics processing units (GPUs) that allow acceleratedprocessing of voice recognition algorithms.

However, for voice-controllable systems that also generate sound, suchas certain TVs and other entertainment systems, mobile phone, computers,blue tooth speakers, etc., referred to herein as voice-controllablesound generating systems (SGS), the effectiveness of thevoice-recognition system may be lessened by the fact that audio outputby the voice-controllable SGS mixes with voice command audio, and maythus mask or make the voice audio difficult to identify.

SUMMARY

Embodiments of the present disclosure provide systems and methods forimproving voice-based control of a voice-controllable sound generatingsystem (SGS) by cancelling or otherwise countering sounds output by asound source associated with the voice-controllable sound generatingsystem. As used herein, a voice-controllable sound generating systemincludes any system that (a) generates sounds and (b) is configured forvoice-based control of one or more functions of the system. Examplevoice-controllable sound generating systems include certainentertainment or media systems (e.g., a voice-controllable TV, stereosystem, smartphone, laptop computer, tablet computer, desktop computer,etc.), blue tooth speakers, intercom systems, etc.

Some embodiments provide an audio countering system configured to obtainand use electronic system-based audio signals communicated by an audiosource of the voice-controllable sound generating system to countercorresponding system-based audio signals received at a voice-controlmicrophone, after being influenced by the surrounding physicalenvironment and/or other distortion factors. By cancelling or counteringthe audio output by the sound generating system and received at themicrophone, the audio countering system can effectively isolate orenhance a voice-based signal received at the microphone simultaneouswith the sounds waves output by the sound generating system. A voicecontrol system may then analyze the resulting audio signals, with theenhanced voice-based signal (resulting from the cancellation/counteringof the sound generating system audio) to identify voice commands for thevoice-controllable sound generating system, which may then beimplemented.

In one embodiment, the system may use the same digital source to null,counterbalance, cancel, or compensate for the microphone to the soundsgenerated by the device. This may be performed in the digital domain.Furthermore, the system may compensate for system volume, speakeraberrations or anomalies, proximity to speaker, reflections, or echoes.In another embodiment, the system may use the digital source tocompensate in the analog domain. For example, the digital source may beconverted from analog to digital, the phase inversion may be matched,and gain or attenuation may be applied. In yet another embodiment, ananalog source could be converted to the digital domain andcompensated-for in the digital domain. In still yet another embodiment,an analog source could be used by matched phase inversion and attenuatedor gain applied as-needed.

The system may include a digital or analog sound source. The output ofthe sound source may be amplified with a specified gain to the speaker.In parallel, the output of the sound source may be routed to a digitalor analog compensation circuit. The compensation circuit may also haveinput from a system microphone, into which voice-activated commands areentered. The compensation circuit may perform, in the digital or analogdomain as described above, compensation for the output of the soundsource in the input from the microphone. A delay may be accounted-for,wherein the sound source may have taken an amount of time to reach themicrophone after leaving the speaker. The compensated signal may beinterpreted by the rest of the system, in which voice-activated commandsare scanned-for and recognized.

In order to compensate for sound-making devices, the system may firstperform a calibration mode. In the calibration mode, an available ordesired audio frequency band sweep may be performed, wherein the soundsource goes through such a sweep to determine ambient conditions ofinput for the microphone. Through such a process, expected delay, echo,and other signal aspects may be characterized. This characterizationinformation may be used in subsequent operating modes to compensate-forthe sound source during operation and detection of voice commands.

BRIEF DESCRIPTION OF THE DRAWINGS

Example aspects of the present disclosure are described below inconjunction with the figures, in which:

FIG. 1 illustrates an example system for cancelling or otherwisecountering the audio output by a voice-controllable sound generatingsystem (e.g., a voice-controllable media system) to enhance theidentification of voice commands from a user, according to embodimentsof the present invention;

FIG. 2 illustrates an example sound compensation system for a digitalvoice-controllable sound generating system, with synchronizationsignaling, according to one embodiment;

FIG. 3 illustrates an example sound compensation system for a digitalvoice-controllable sound generating system, without synchronizationsignaling, according to one embodiment;

FIG. 4 illustrates an example sound compensation system for an analogvoice-controllable sound generating system, according to one embodiment;

FIG. 5 illustrates an example system for synchronizing soundcancelling/countering signals for an analog voice-controllable soundgenerating system, according to one embodiment; and

FIG. 6 illustrates an example method for calibrating a soundcompensation system to compensate for acoustic distortion factors,according to one embodiment.

DETAILED DESCRIPTION

Embodiments of the present disclosure provide systems and methods forimproving voice-based control of a voice-controllable sound generatingsystem (e.g., a voice-controllable entertainment system, TV, stereosystem, smartphone, computer, blue tooth speaker, intercom system, etc.)by cancelling or otherwise countering sounds output by a sound sourceassociated with the voice-controllable sound generating system.

A voice-controllable sound generating system may include a soundgenerating system (SGS) audio source that generates and/or communicatesElectronic SGS audio signals for output by speaker(s) as sound waves,and a voice-control microphone intended to receive voice-based audioincluding voice commands for controlling various functions of the soundgenerating system (e.g., power, operating mode, volume, channel, mediaplayback functions, system settings, etc.). However, the voice-controlmicrophone may also inherently receive SGS-based sound waves output bythe speaker(s) and/or additional sound waves from other sources, whichmay distort or conceal the voice audio and thus complicate the accurateidentification of the voice commands from the audio received at themicrophone.

Thus, some embodiments provide an audio countering system configured toobtain and use the Electronic SGS audio signals communicated by the SGSaudio source (the same electronic audio signals that are communicated toand output by the speakers as SGS-based sound waves) to counter theSGS-based audio received at the voice-control microphone (after beinginfluenced by the surrounding physical environment and/or otherdistortion factors). For example, the audio countering system may usethe electronic SGS audio signals to generate SGS audio-counteringsignals calculated to cancel or otherwise counter the SGS-based audioreceived via the microphone. By cancelling or countering the SGS-basedaudio received via the microphone, the audio countering system caneffectively isolate or enhance a voice-based audio signal from acombined audio signal—including the voice-based audio signal and theSGS-based audio—received at the microphone. The audio countering systemcan then communicate the SGS audio-countered audio signal (including theisolated or enhanced voice-based audio signal) to a voice controlsystem, which may identify voice commands from the SGS audio-counteredaudio signal, and implement the control functions associated with theidentified voice commands.

The audio countering system may be integrated in or separate from thesound generating system, according to the particular embodiment.

Some embodiments include a calibration system configured to modify(calibrate) the Electronic SGS audio signals used by the audiocountering system to cancel or otherwise counter the SGS-based audioreceived via the microphone, in particular to account for the influenceof distortion factors on the SGS-based audio, e.g., time delay andsignal distortion caused by the transmission, reflection, absorption,diffusion, etc. of sound waves. Distortion factors are discussed in moredetail below with reference to FIG. 1.

The calibration system may be configured to output calibration audiosignals via one or more speakers, receive calibration sound waves viathe microphone, and determine calibration data that characterizes theinfluence of distortion factors on the calibration audio signals. Theaudio countering system may use the calibration data for counteringSGS-based audio signals output via speakers and received at thevoice-control microphone, in particular to compensate for distortionfactors that influence the SGS-based audio signals.

The calibration audio source may be the SGS audio source or a separatecalibration audio source. Further, the calibration audio signals maycomprise portion(s) of the SGS-based audio signals generated by the SGSaudio source, or may be integrated in or combined with SGS-based audiosignals generated by the SGS audio source, or may be output separatefrom the output of SGS-based audio signals (e.g., during a definedcalibration process), or otherwise generated and output.

Other embodiments may omit the calibration system.

Various embodiments may be configured for one or more of the followingtypes of sound countering:

1. “Digital-digital sound-countering”: Digital countering of digitalsignals. For example, using electronic SGS-based audio signals generatedby a digital SGS audio source to counter digital SGS-based audio signals(e.g., audio signals received at a voice-control microphone and passedthrough an analog-to-digital (ADC) converter).

2. “Digital-analog sound-countering”: Digital countering of analogsignals. For example, using electronic SGS-based audio signals generatedby a digital SGS audio source to counter analog SGS-based audio signalsreceived at a voice-control microphone.

3. “Analog-analog sound-countering”: Analog countering of analogsignals. For example, using electronic SGS-based audio signals generatedby an analog SGS audio source to counter analog SGS-based audio signalsreceived at a voice-control microphone.

4. “Analog-digital sound-countering”: Analog countering of digitalsignals. For example, using electronic SGS-based audio signals generatedby an analog SGS audio source to counter digital SGS-based audio signals(e.g., audio signals received at a voice-control microphone and passedthrough an analog-to-digital (ADC) converter).

FIG. 1 illustrates an example system 10 for cancelling or otherwisecountering the audio output by a voice-controllable sound generatingsystem (e.g., a voice-controlled media system) to enhance theidentification of voice commands from a user, according to embodimentsof the present invention. The voice-controllable sound generating system(SGS) may comprise a TV, a stereo system, a smartphone, laptop computer,tablet computer, desktop computer, or any other system configured tooutput SGS-based sounds, receive voice-based commands (e.g., via amicrophone), and control one or more aspects of the voice-controllablesound generating system based on the voice-based commands.

As shown in FIG. 1, example system 10 may include a sound generatingsystem (SGS) audio source 12, one or more speakers 14, a microphone 16,a calibration system 20, an audio countering system 22, and a voicecontrol system 24. SGS audio source may comprise any circuits and otherelectronics configured to communicate Electronic SGS audio signals 30(analog or digital) to one or more speakers 14, which may output thesignals 30 as SGS-based sound waves (e.g., into a room or an outdoorenvironment). In some embodiments, SGS audio source may generate andcommunicate the Electronic SGS audio signals 30 (analog or digital) foroutput by speaker(s) 14 as SGS-based sound waves. In other embodiments,SGS audio source may act as a pass-through or relay to communicateElectronic SGS audio signals 30 received from another source tospeaker(s) 14.

The SGS-based sound waves output by speaker(s) 14 may be received bymicrophone 16, converted to electronic signals (analog or digital), andcommunicated to audio countering system 22. The SGS-based audio signalsmay be influenced by one or more distortion factors 40 (e.g., sound wavereflections due to the respective room and/or other physical objects),as discussed below in more detail. Thus, the SGS-based electronicsignals communicated by microphone 16 to audio countering system 22 arereferred to as distorted SGS-based audio signals 32.

Microphone 16 may receive sound waves from any number of audio sources,including SGS-based sound waves discussed above, voice-based sound waves(which may include voice commands) from one or more persons 60, and/orother sound waves from other source(s). Microphone 16 may convert orotherwise process the combined audio signals to generatemicrophone-processed audio signals 36 (analog or digital), which mayinclude (a) distorted SGS-based audio signals 32 corresponding withreceived SGS-based sound waves, and (b) additional audio signals 34including voice-based signals corresponding with received voice-basedsound waves and any other signals corresponding with sound wavesreceived from any other source.

In some embodiments, in addition to communicating Electronic SGS audiosignals 30 for output by speaker(s) 14 as sound waves, SGS audio sourcemay also communicate the same Electronic SGS audio signals 30 to audiocountering system 22. Audio countering system 22 may use the ElectronicSGS audio signals 30 received from SGS audio source to counter thedistorted SGS-based audio signals 32 (based on the same Electronic SGSaudio signals 30) received from microphone 16, to thereby help isolateor enhance voice-based audio signals in the combinedmicrophone-processed audio signals 36.

Audio countering system 22 may include any circuits and/or otherelectronics configured to use Electronic SGS audio signals 30 receivedfrom SGS audio source to cancel or otherwise counter distorted SGS-basedaudio signals 32 received via microphone 16, e.g., to improve thedetection of human voice commands within the additional audio signals 34received via microphone 16. When a person speaks a voice command for aselected function associated with system 10 while speaker(s) 14 areoutputting SGS-based sound waves, microphone 16 receives combined soundwaves and generates a combined audio signal 36 including distortedSGS-based audio signals 32 combined with voice-based audio signals 34and/or other audio signals.

Audio countering system 22 may use electronic SGS audio signals 30(which may or may not be modified/calibrated based on calibration data44 provided by a calibration system 20, as discussed below) to counterthe distorted SGS-based audio signals 32 within the combined audiosignal 36 received from microphone 16, to generate SGS-countered audiosignals 38, which may substantially isolate or enhance the voice-basedaudio signals (digital or analog) from the combined audio signal 36received from microphone 16. Audio countering system 22 may thencommunicate the SGS-countered audio signals 38 to voice control system24, which may analyze the signals 38 to identify one or more voicecommands, and implement or communicate with a respective control systemto implement the identified voice command(s) for selected function(s)associated with system 10.

In some embodiments, audio countering system 22 may modify theElectronic SGS audio signals 30 received from SGS audio source based oncalibration data 44 calculated by calibration system 20, and use themodified (calibrated) Electronic SGS audio signals 46 to cancel orcounter the distorted SGS-based audio signals 32 received via microphone16. For example, audio countering system 22 may use calibration data 44calculated by calibration system 20 to calibrate the Electronic SGSaudio signals 30 to better match the distorted SGS-based audio signals32 to be countered. Thus, the calibration data 44 may be configured toaccount for the influence of distortion factors 40 on the SGS-basedaudio signals, e.g., at any point(s) along the course from thecommunication of the electronic SGS audio signals 30 from SGS audiosource, through the transmission and detection of correspondingSGS-based sound waves (from the system speaker(s) 14, through a physicalenvironment, and to microphone 16), and to the communication ofdistorted SGS-based audio signals 32 to audio countering system 22 forprocessing.

Calibration system 20 may include a calibration audio source 21configured to generate and output calibration audio signals 42 (analogor digital) to speaker(s) 14 for output as sound waves. The calibrationsound waves may be received at microphone 16 and converted to distortedcalibration audio signals 48 (analog or digital), which may becommunicated to calibration system 20 for processing. Distortedcalibration audio signals 48 represent calibration audio signals 42after being influenced or “distorted” by distortion factors 40.Calibration audio signals 42 may include any audio signals(human-audible or human-inaudible) that may be used for such systemcalibration. For example, calibration audio signals 42 may include whitenoise signals output at various volumes to define baseline compensationvariables for spatial aberrations of the listening space and systemperformance.

Calibration system 20 may analyze the distorted calibration audiosignals 48 (e.g., in comparison to the corresponding calibration audiosignals 42) using any suitable algorithms to determine calibration data44 that characterizes the influence of distortion factor(s) 40 on thecalibration audio signals 42. As discussed above, audio countering unit22 may use the calibration data 44 for countering distorted SGS-basedaudio signals 32 communicated by microphone 16. For example, audiocountering unit 22 may apply the calibration data 44 to Electronic SGSaudio signals 30 generated by SGS audio source to generate calibratedSGS-based audio signals 46 that may be are used to counter the distortedSGS-based audio signals 32 received from microphone 16.

In some embodiments, calibration audio signals 42 may include syncsignals, which may be used to measure the delay between (a) electronicSGS audio signals 30 being received at audio countering system 22 fromSGS audio source 12 and (b) the resulting distorted SGS-based audiosignals 32 received at audio countering system 22 via microphone 16,such that the earlier-received electronic SGS audio signals 30 may betime-shifted by the appropriate time period for countering thelater-received distorted SGS-based audio signals 32. Synch signals 42may include any sound signals, at any volume and frequency orfrequencies (human-audible or human-inaudible) that can be detected bymicrophone 16 and identified by calibration system 20 or audiocountering system 22. In some embodiments, sync signals may be outputusing a spread-spectrum approach in the audio domain.

As used herein, “distortion factors” may include any factors thatdistort or otherwise influence any type of audio signals (e.g.,SGS-based audio signals, calibration audio signals, etc.), while in theform of electronic signals (e.g., being communicated between differentcomponents of system 10) or in the form of sound waves (e.g., betweenbeing output by speaker(s) and being received at a microphone). Forexample, distortion factors may include transmission effects on audiosignals from a respective sound source (e.g., SGS audio source orcalibration audio source) to one or more speaker(s), effects imparted bythe speaker(s) themselves, acoustic effects from the surroundingphysical environment (e.g., walls, furniture, or other physicalstructures), acoustic effects from interaction between sound waves frommultiple speakers, effects imparted by the microphone itself (e.g.,associated with collection and/or processing of audio signals), and/orany other distortion factors that may impart an influence on therespective audio signals (e.g., SGS-based audio signals or calibrationaudio signals).

In other embodiments, audio countering system 22 is configured to cancelor counter the distorted SGS-based audio signals 32 without calibratingthe Electronic SGS audio signals 30, and may thus omit calibrationsystem 20.

In addition, audio countering system 22 may include any circuits and/orother electronics configured to provide any one or more of the followingtypes of sound-countering defined above: (1) digital-digitalsound-countering, (2) digital-analog sound-countering, (3) analog-analogsound-countering, and/or (4) analog-digital sound-countering.

FIG. 2 illustrates components of an example sound compensation system10A for a digital voice-controllable sound generating system (SGS), withsynchronization signaling, according to one example embodiment of system10 shown in FIG. 1. System 10A may include a digital SGS audio source 12configured to output digital SGS audio signals 30 for output viaspeaker(s) 14, e.g., after passing through a digital-to-analog converter80 and amplifier 82. In addition, digital SGS audio source 12 maycommunicate the same digital SGS audio signals 30 to a digital audiocountering system 22A, which uses the SGS audio signals 30 to counter(distorted and time-shifted) audio signals received via microphone 16,e.g., as discussed above regarding FIG. 1.

In some embodiments, the system may include multiple digital SGS audiosources simultaneously outputting different sounds. Thus, as shown inFIG. 2, at least one further digital SGS audio source 12′ may generatedigital SGS audio signals for output via speaker(s) 14′, e.g., afterpassing through a digital-to-analog converter 80′ and amplifier 82′. Inaddition, each further digital SGS audio source 12′ may communicate itsdigital SGS audio signals to digital audio countering system 22A, whichmay use the SGS audio signals from the multiple SGS audio sources 12,12′ to counter audio signals received via microphone 16.

System 10A may include a calibration system 20 configured to generateand output synchronization or calibration signals 42, e.g., at definedintervals or based on defined triggering events. In the illustratedexample, calibration system 20 outputs synchronization (“sync”) signals42 via the same speaker(s) 14 as the audio from digital SGS audio source12. Synch signals 42 may include any sound signals (within or outsidethe typical human-audible frequency spectrum) that can be detected bymicrophone 16 and identified by DSP 70.

Digital audio countering system 22A may include an amplifier 60, ananalog-to-digital converter (ADC) 62, a digital signal processor (DSP)70, a memory buffer (e.g., RAM), and one or more compensation algorithms50, along with any other hardware and/or software/firmware componentsfor performing sound countering functionality. DSP 70 may be configuredto receive (a) digital SGS audio signals 30 from digital SGS audiosource 12 (and/or at least one further digital SGS audio source 12′) and(b) microphone-processed audio signals 36 received via microphone 16,e.g., after passing through amp 60 and ADC 62. Microphone-processedaudio signals 36 may include audio signals originating from varioussources and distorted by various distortion factors 40, as discussedabove regarding FIG. 1. For example, microphone-processed audio signals36 may include combination of (a) distorted SGS-based audio signals 32corresponding with digital SGS audio signals 30, (b) distorted syncsignals 48 corresponding with synch signals 42, and/or (c) additionalaudio signals 34 including voice-based sound signals.

DSP 70 may be configured to use digital SGS audio signals 30, which maybe time-shifted or synchronized based on received sync signals 48, tocounter the distorted SGS-based audio signals 32 within the combinedaudio signal 36 received from microphone 16, to generate SGS-counteredaudio signals 38, which may substantially isolate or enhance anyvoice-based audio signals received via microphone 16. DSP 70 may beconfigured to execute any suitable audio compensation algorithms 50 forgenerating SGS-countered audio signals 38. For example, DSP 70 may beconfigured to execute an adaptive filter according to any known orsuitable active noise cancellation algorithms or routines, e.g.,embodied as a known or suitable frequency-domain adaptive filterconfigured to efficiently process discrete-time signals using block andmulti-rate adaptive filtering, e.g., as disclosed in the article“Frequency-domain and multirate adaptive filtering” by J. J. Shynk, IEEESignal Processing Magazine, Vol. 9, Issue 1, January 1992 (pp. 14-37).

In some embodiments, DSP 70 may utilize a memory buffer (e.g., RAM) 72to temporarily store (buffer) digital SGS audio signals 30 from digitalSGS audio source 12 (or multiple digital SGS audio sources 12, 12′), touse the digital SGS audio signals 30 for countering the correspondingdistorted and time-shifted (delayed) SGS-based audio signals 32 receivedvia microphone 16. DSP 70 may determine the appropriate duration fortime-shifting the (earlier-received) digital SGS audio signals 30 tocounter the corresponding (later-received) distorted SGS-based audiosignals 32 based on the timing of distorted sync signals 48 receivedfrom microphone 16 and clock-based timing of the output of synch signals42 from calibration system 20. System 10A may recalibrate thistime-shifting delay at any suitable interval or upon any definedtriggering event).

FIG. 3 illustrates components of an example sound compensation system10B for a digital voice-controllable sound generating system, withoutsynchronization signaling, according to one example embodiment of system10 shown in FIG. 1.

System 10B may be generally similar to system 10A shown in FIG. 2, butmay be configured to synchronize the use of digital SGS audio signals 30received from SGS audio source 30 to counter the resulting(later-received) distorted SGS-based audio signals 32 without the use ofdiscrete sync signals 42.

In some embodiments, the digital audio countering system 22B of system10B may include (a) a first memory buffer (e.g., RAM) 72 for storing orbuffering digital SGS audio signals 30 from digital SGS audio source 12(or multiple digital SGS audio sources 12, 12′) and a second memorybuffer (e.g., RAM) 74 for storing or buffering SGS-based audio signals32 received via microphone 16. DSP 70 may analyze the details of therespective audio signals stored in buffers 72 and 74 to identifyelements of distorted SGS-based audio signals 32 that match thecorresponding digital SGS audio signals 30, and determine the associateddelay between the two signals based on the identified matching signalelements. DSP 70 may then use this determined delay to time-shiftdigital SGS audio signals 30 to counter the corresponding(later-received) distorted SGS-based audio signals 32. System 10A mayperform such time-shift calibration continuously, or at any suitableinterval, or upon any defined triggering event.

FIG. 4 illustrates components of an example sound compensation system10C for an analog voice-controllable sound generating system, accordingto one example embodiment of system 10 shown in FIG. 1. System 10C mayinclude an analog SGS audio source 12 configured to output analog SGSaudio signals 30 for output via speaker(s) 14, e.g., after passingthrough an amplifier 82. In addition, analog SGS audio source 12 maycommunicate the same analog SGS audio signals 30 to an analog audiocountering system 22C, which uses the analog SGS audio signals 30 tocounter (distorted and time-shifted) analog audio signals received viamicrophone 16, e.g., as discussed above regarding FIG. 1.

System 10C may include a calibration system 20 configured to generateand output synchronization or calibration signals 42, e.g., at definedintervals or based on defined triggering events. In the illustratedexample, calibration system 20 outputs synchronization (“sync”) signals42 via the same speaker(s) 14 as the audio from analog SGS audio source12. Synch signals 42 may include any sound signals (within or outsidethe typical human-audible frequency spectrum) that can be detected bymicrophone 16 and identified by DSP 70.

Analog audio countering system 22C may include an amplifier 60 and ananalog compensation circuit 90. Analog audio countering system 22C maybe configured to receive (a) analog SGS audio signals 30 from analog SGSaudio source 12 and (b) analog microphone-processed audio signals 36received via microphone 16, e.g., after passing through amplifier 60.Microphone-processed audio signals 36 may include audio signalsoriginating from various sources and distorted by various distortionfactors 40, as discussed above regarding FIG. 1. For example,microphone-processed audio signals 36 may include combination of (a)distorted SGS-based audio signals 32 corresponding with analog SGS audiosignals 30, (b) distorted sync signals 48 corresponding with synchsignals 42, and/or (c) additional audio signals 34 including voice-basedsound signals.

Analog audio countering system 22C may use analog SGS audio signals 30,which may be time-shifted or synchronized based on received sync signals48, to counter the distorted SGS-based audio signals 32 within thecombined audio signal 36 received from microphone 16, to generateSGS-countered audio signals 38, which may substantially isolate orenhance any voice-based audio signals received via microphone 16. Analogcompensation circuit 90 may include any suitable delay/synchronizationcircuitry for time-shifting (delaying) the application of analog SGSaudio signals 30 received from analog SGS sound source 12 tocancel/counter the corresponding (later-received) distorted SGS-basedaudio signals 32 received via microphone 16, e.g., using a time delaybased on the determined delay between sync signals 42 generated bycalibration source 20 and the resulting distorted sync signals 48received via microphone 16.

FIG. 5 illustrates an example system 100 for synchronizing soundcancelling/countering signals for an analog voice-controllable soundgenerating system, e.g., for use in system 10C shown in FIG. 4,according to one embodiment. System 100 may include the previouslydiscussed components of microphone 16, amplifier 60, and analog SGSaudio source 12, along with spatial aberration correction electronics102, PLL (phase locked loop) and AGC (acoustic gain control) electronics104, phase shift adjust and AGC electronics 106, and an audio sumamplifier 108.

As shown in FIG. 5, analog SGS audio source 12 outputs analog SGS audiosignals to both (a) speaker(s) 14 and (b) the sound countering system,in particular to (i) phase shift adjust and AGC electronics 106 and (ii)PLL and AGC electronics 104. Microphone 16 receives sound waves(originated from the SGS audio signals, any voice-based signals, and/orother audio sources) and generate corresponding analog audio signals,which are passed through amplifier 60 and then processed by spatialaberration correction electronics 102. The spatial-aberration-correctedsignals are then passed to both (a) PLL and AGC electronics 104 and (b)audio sum amplifier 108.

As shown, PLL and AGC electronics 104 receives (a) analog SGS audiosignals from analog SGS audio source 12 and (b) microphone-received andspatial-aberration-corrected signals from spatial aberration correctionelectronics 102, and process such signals to generate phase errorvoltage and gain adjust signals 105, which are passed to phase shiftadjust and AGC electronics 106. Phase shift adjust and AGC electronics106 phase-shifts the analog SGS audio signals received from analog SGSaudio source 12 based on the phase error voltage and gain adjust signals105 received from PLL and AGC electronics 104, and forwards thephase-shifted SGS audio signals to audio sum amplifier 108.

Audio sum amplifier 108 sums (a) the microphone-received signalsreceived from spatial aberration correction electronics 102 with (b) thephase-shifted SGS audio signals received from phase shift adjust and AGCelectronics 106, which cancels or counters the portion of themicrophone-received signals corresponding with the SGS audio signalsoutput via speaker(s) 14 and received via microphone 16, to therebyenhance the remaining portions of the microphone-received signals, e.g.,including voice-based signals. Thus, the resulting SGS-countered audiosignals 38 may enhance any voice commands included in themicrophone-received signals, and these enhanced signals 38 may then bepassed to a voice control system 24 for processing.

In some embodiments, a synchronization signal can be used to create thePPL error voltage and gain control signals 105. For example, analog SGSaudio source 12 may include a synchronization signals generator 120configured to generate and output synchronizations signals. Further, insome embodiments, the spatial aberration correction electronics 102 maybe omitted, thus providing a fully analog system.

FIG. 6 illustrates an example method 200 for calibrating a soundcompensation system to compensate for acoustic distortion factors,according to one embodiment. In some embodiments, method 200 may beimplemented by calibration system 20 shown in FIGS. 1-3, for example.With reference to FIG. 1, at 202 calibration system 20 may outputcalibration signals 42 via speaker(s) 14 at a first (e.g., low) volume.In one embodiment the calibration signals 42 may comprise white noisesignals. At 204, calibration system 20 may receive and analyze distortedcalibration audio signals 48 via microphone 16 and determinecompensation values representing the distortion of the received audiosignals, e.g., as imparted by any distortion factors 40.

At 206, calibration system 20 may determine whether a predefined maximumvolume for the calibration signals has been reached. If not, asindicated at 208, calibration system 20 may output calibration signals42 via speaker(s) 14 at a next (e.g., higher) volume level, anddetermine resulting compensation values based on distorted calibrationaudio signals 48 received via microphone 16. When calibration system 20determines at 206 that the predefined maximum volume for the calibrationsignals has been reached, all compensation values are acquired and themethod may end, as indicated at 210.

1. A system for enhanced processing of voice-based signals in avoice-controllable sound-generating system (SGS), comprising: an SGSaudio source configured to communicate electronic SGS audio signals toboth (a) one or more speakers and (b) an audio countering system;wherein the one or more speakers are configured to: receive theelectronic SGS audio signals communicated by the SGS audio source; andoutput SGS sound waves corresponding with the electronic SGS audiosignals; a microphone configured to, during a period of system-basedaudio output, detect sound waves and output microphone-processed audiosignals representing the detected sound waves, the microphone-processedaudio signals including: (a) distorted SGS audio signals correspondingwith the electronic SGS audio signals output by the one or more speakersas SGS sound waves and detected by the microphone; and (b) additionalaudio signals resulting from sound waves originated from one or moresources other than the SGS audio source and detected by the microphone;and wherein the audio countering system is configured to: receive theelectronic SGS audio signals from the SGS audio source; receive themicrophone-processed audio signals representing the detected soundwaves; use the electronic SGS audio signals received from the SGS audiosource to generate SGS-countered audio signals that cancel or counterthe distorted SGS audio signals included in the microphone-processedaudio signals and thereby enhance the additional audio signals.
 2. Thesystem of claim 1, wherein: the additional audio signals includevoice-based signals including a voice command from a human; and thesystem further comprises a voice-control system configured to: receivethe SGS-countered audio signals from the audio countering system;identify the voice command from the SGS-countered audio signals; andcontrol at least one function associated with the SGS based on the voicecommand identified from the SGS-countered audio signals.
 3. The systemof claim 1, further comprising a calibration system configured tocommunicate electronic calibration signals for output by the one or morespeakers as calibration sound waves; wherein the microphone isconfigured to, during an audio calibration period, detect calibrationsound waves and output microphone-processed calibration audio signalsrepresenting the detected calibration sound waves, themicrophone-processed calibration audio signals including distortedcalibration audio signals corresponding with the electronic calibrationsignals output by the one or more speakers as calibration sound wavesand detected by the microphone; and wherein the calibration system isfurther configured to determine calibration data based on (a) theelectronic calibration signals and (b) the distorted calibration audiosignals in the microphone-processed audio signals.
 4. The system ofclaim 3, wherein the audio countering system is configured to: receivethe calibration data from the calibration system; use the calibrationdata to calibrate the electronic SGS audio signals communicated by theSGS audio source, to define calibrated electronic SGS audio signals; usethe calibrated electronic SGS audio signals to counter the distorted SGSaudio signals in the microphone-processed audio signals to generate theSGS-countered audio signals.
 5. The system of claim 3, wherein: thedistorted SGS audio signals in the microphone-processed audio signalsrepresent an distorted version of the electronic SGS audio signals,resulting from the influence of distortion factors acting at least onthe SGS sound waves output by the one or more speakers; and thecalibration data characterizes the influence of distortion factors. 6.The system of claim 3, wherein the audio calibration period occursduring the period of system-based audio output.
 7. The system of claim3, wherein the audio calibration period occurs separate from the periodof system-based audio output.
 8. The system of claim 1, wherein theelectronic SGS audio signals communicated by the SGS audio sourcecomprise digital SGS audio signals.
 9. The system of claim 8, wherein:the microphone-processed audio signals comprise digital signals; and theaudio countering system is configured to provide digital processing ofthe digital SGS audio signals and digital microphone-processed audiosignals to generate the SGS-countered audio signals
 10. The system ofclaim 1, wherein the electronic SGS audio signals communicated by theSGS audio source comprise analog SGS audio signals.
 11. The system ofclaim 10, wherein: the microphone-processed audio signals compriseanalog signals; and the audio countering system is configured to provideanalog processing of the analog SGS audio signals and analogmicrophone-processed audio signals to generate the SGS-countered audiosignals
 12. A method for enhanced processing of voice-based signals in avoice-controllable sound-generating system (SGS), the method comprising:communicating, from an SGS audio source configured, electronic SGS audiosignals to both (a) one or more speakers for output of SGS sound wavescorresponding with the electronic SGS audio signals, and (b) an audiocountering system; detecting, at a microphone, sound waves including:(a) SGS sound waves corresponding with the electronic SGS audio signalsoutput by the one or more speakers; and (b) additional sound wavesoriginated from one or more sources other than the SGS audio source; andgenerating, by the microphone, microphone-processed audio signalsrepresenting the detected sound waves, the microphone-processed audiosignals including: (a) distorted SGS audio signals corresponding withthe detected SGS sound waves; and (b) additional audio signalscorresponding with the detected additional sound waves; and receiving,at the audio countering system: (a) the electronic SGS audio signalscommunicated from the SGS audio source; and (b) the microphone-processedaudio signals representing the detected sound waves; and processing, bythe audio countering system, the electronic SGS audio signals receivedfrom the SGS audio source to generate SGS-countered audio signals thatcancel or counter the distorted SGS audio signals included in themicrophone-processed audio signals and thereby enhance the additionalaudio signals.
 13. The method of claim 12, wherein: the additional audiosignals include voice-based signals including a voice command from ahuman; and the method further comprises: receiving, at a voice-controlsystem, the SGS-countered audio signals from the audio counteringsystem; identifying, by the voice-control system, the voice command fromthe SGS-countered audio signals; and controlling, by the voice-controlsystem, at least one function associated with the SGS based on the voicecommand identified from the SGS-countered audio signals.
 14. The methodof claim 12, further comprising: communicating electronic calibrationsignals for output by the one or more speakers as calibration soundwaves; detecting, by the microphone, calibration sound waves and outputmicrophone-processed calibration audio signals representing the detectedcalibration sound waves, the microphone-processed calibration audiosignals including distorted calibration audio signals corresponding withthe electronic calibration signals output by the one or more speakers ascalibration sound waves and detected by the microphone; and determining,by the calibration system, calibration data based on (a) the electroniccalibration signals and (b) the distorted calibration audio signals inthe microphone-processed audio signals.
 15. The method of claim 14,further comprising: using the calibration data to calibrate theelectronic SGS audio signals communicated by the SGS audio source, todefine calibrated electronic SGS audio signals; and using the calibratedelectronic SGS audio signals to counter the distorted SGS audio signalsin the microphone-processed audio signals to generate the SGS-counteredaudio signals.
 16. The method of claim 14, wherein: the distorted SGSaudio signals in the microphone-processed audio signals represent andistorted version of the electronic SGS audio signals, resulting fromthe influence of distortion factors acting at least on the SGS soundwaves output by the one or more speakers; and the calibration datacharacterizes the influence of distortion factors.
 17. The method ofclaim 12, wherein the electronic SGS audio signals communicated by theSGS audio source comprise digital SGS audio signals.
 18. The method ofclaim 17, wherein: the microphone-processed audio signals comprisedigital signals; and the audio countering system performs digitalprocessing of the digital SGS audio signals and digitalmicrophone-processed audio signals to generate the SGS-countered audiosignals
 19. The method of claim 12, wherein the electronic SGS audiosignals communicated by the SGS audio source comprise analog SGS audiosignals.
 20. The method of claim 19, wherein: the microphone-processedaudio signals comprise analog signals; and the audio countering systemperforms analog processing of the analog SGS audio signals and analogmicrophone-processed audio signals to generate the SGS-countered audiosignals.